CDS

Accession Number TCMCG074C20238
gbkey CDS
Protein Id KAF8401845.1
Location complement(join(13101670..13102574,13107918..13108013,13112027..13112186,13112279..13112335,13113044..13113191,13113288..13113295,13116429..13116464))
Organism Tetracentron sinense
locus_tag HHK36_012792

Protein

Length 469aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA625382, BioSample:SAMN14615867
db_source JABCRI010000008.1
Definition hypothetical protein HHK36_012792 [Tetracentron sinense]
Locus_tag HHK36_012792

EGGNOG-MAPPER Annotation

COG_category K
Description Transcription factor that specifically binds AT-rich DNA sequences related to the nuclear matrix attachment regions (MARs)
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0002682        [VIEW IN EMBL-EBI]
GO:0002683        [VIEW IN EMBL-EBI]
GO:0003674        [VIEW IN EMBL-EBI]
GO:0003676        [VIEW IN EMBL-EBI]
GO:0003677        [VIEW IN EMBL-EBI]
GO:0003680        [VIEW IN EMBL-EBI]
GO:0003690        [VIEW IN EMBL-EBI]
GO:0003700        [VIEW IN EMBL-EBI]
GO:0005488        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0006355        [VIEW IN EMBL-EBI]
GO:0006950        [VIEW IN EMBL-EBI]
GO:0006952        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009605        [VIEW IN EMBL-EBI]
GO:0009607        [VIEW IN EMBL-EBI]
GO:0009620        [VIEW IN EMBL-EBI]
GO:0009889        [VIEW IN EMBL-EBI]
GO:0010468        [VIEW IN EMBL-EBI]
GO:0010556        [VIEW IN EMBL-EBI]
GO:0019219        [VIEW IN EMBL-EBI]
GO:0019222        [VIEW IN EMBL-EBI]
GO:0031323        [VIEW IN EMBL-EBI]
GO:0031326        [VIEW IN EMBL-EBI]
GO:0031347        [VIEW IN EMBL-EBI]
GO:0031348        [VIEW IN EMBL-EBI]
GO:0043207        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0043565        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0045088        [VIEW IN EMBL-EBI]
GO:0045824        [VIEW IN EMBL-EBI]
GO:0048519        [VIEW IN EMBL-EBI]
GO:0048583        [VIEW IN EMBL-EBI]
GO:0048585        [VIEW IN EMBL-EBI]
GO:0050776        [VIEW IN EMBL-EBI]
GO:0050777        [VIEW IN EMBL-EBI]
GO:0050789        [VIEW IN EMBL-EBI]
GO:0050794        [VIEW IN EMBL-EBI]
GO:0050832        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]
GO:0051171        [VIEW IN EMBL-EBI]
GO:0051252        [VIEW IN EMBL-EBI]
GO:0051704        [VIEW IN EMBL-EBI]
GO:0051707        [VIEW IN EMBL-EBI]
GO:0060255        [VIEW IN EMBL-EBI]
GO:0065007        [VIEW IN EMBL-EBI]
GO:0080090        [VIEW IN EMBL-EBI]
GO:0080134        [VIEW IN EMBL-EBI]
GO:0097159        [VIEW IN EMBL-EBI]
GO:0098542        [VIEW IN EMBL-EBI]
GO:0140110        [VIEW IN EMBL-EBI]
GO:1901363        [VIEW IN EMBL-EBI]
GO:1903506        [VIEW IN EMBL-EBI]
GO:1990837        [VIEW IN EMBL-EBI]
GO:2000112        [VIEW IN EMBL-EBI]
GO:2001141        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGATACCTTTGTTACGGGAGTGGAAGGCCTGGAAACGTTCCTGGAGTACCGTGGCAATAGAGCCGCACTCTTTGATGGCATTGAGGAGGGTGGTATCAGGGCTTCCTCATCTTACTCCCATGAAATTGATGAGCATGACAATGACAGTGCTATTGATGGACTTCAAGATAGAGTCAACCTACTGAAGAGATTGTCAGGTGATATACACGAGGAGGTGGAGACTCATAACAGCGTGCTGGACCAAATGGGCAACAATATGGATTCATCAAGGGGAATCTTGTCGGGAACTATGGATCGGTTCAAGATGGTAAGTAAACCAAATTCCGTTAATTCTTTTGCGATCAGTTTTCTTCTTTTTACAGCTGCTATCCCGTTTGCCGATTGTAACAAAGTTGTTACTACTGTAGAACTCCTCGACAGTGGAGAAAGTGTGGATGAGGCGAACGGAACTGCCCTTGCAGGCAGCCTGCTTGGATTTGGCGTAGGGATTGTGGAACCAATAGCTTGTGTACGTTCTCCTGTTTTTCTACCTTTGTTTTTCCAATTCGCAAAACTGGCGAACCGGTGGTGGACCGAGCCGTTGGGGCCACCGGGAATCGGACCAGCAGCTGGTTCACTTGTCATGAAGAAACGAGACCGGGAAACATCGATCAACGACAACGGAGGAAGCAGCAGCGGTGGAAGAGACGACGAAGAAGAAAGAGAGAACGGAGATGAGACCAAAGATGGTGCAGTCGAGGTCGTGAATCGTCGACCTCGAGGCCGGCCATCGGGGTCCAAGAACAAGCCCAAGCCACCAATCTTTGTAACCAGAGACAGCCCAAACGCCCTCCGGAGCCACGTCATGGAGGTCGCCGCCGGCGCCGATATTGTGGAAAGCGTAGCCCAGTTCGCCAGAAGGCGGCAGCGAGGAGTCTGTGTACTTAGCGGGAGCGGAGCAGTGGCTAACGTCACACTCCGGCAGCCAGCTTCATCGGGAGCTGTTGTGGCACTGCACGGAAGGTTCGAGATACTGTCATTAACTGGAGCATTCCTTCCCGGACCAGCCCCGCCGGGCTCGACCGGACTGACGGTTTACGTAGCGGGTGGTCCGGGTCAGGTGTTGGGTGGCATTGTGGTTGGTACGCTTGTTGCAGCTGGGCCGGTCATGGTGATTGCAGCAATATTTGCTAACGCGACATTCGAGCGGCTTCCGCTCGAAGAAGAAGATGATGATGCAGGTGGTGGGCAGCTTCCGGGCGGTGCTGGAAGCTCACCACCTGCAATTGGGCAGCAGCAGCAGGCTGCGCTGCCGCCCGACCCATCATCCTCGTTGCCGGTTTACAATGTGCCGCCAAATCTATTCCATACTGGTGGTGGGATGAACAACGATGCCTATGCTTGGGCTCATTCTCGCTCACCTTATTGA
Protein:  
MDTFVTGVEGLETFLEYRGNRAALFDGIEEGGIRASSSYSHEIDEHDNDSAIDGLQDRVNLLKRLSGDIHEEVETHNSVLDQMGNNMDSSRGILSGTMDRFKMVSKPNSVNSFAISFLLFTAAIPFADCNKVVTTVELLDSGESVDEANGTALAGSLLGFGVGIVEPIACVRSPVFLPLFFQFAKLANRWWTEPLGPPGIGPAAGSLVMKKRDRETSINDNGGSSSGGRDDEEERENGDETKDGAVEVVNRRPRGRPSGSKNKPKPPIFVTRDSPNALRSHVMEVAAGADIVESVAQFARRRQRGVCVLSGSGAVANVTLRQPASSGAVVALHGRFEILSLTGAFLPGPAPPGSTGLTVYVAGGPGQVLGGIVVGTLVAAGPVMVIAAIFANATFERLPLEEEDDDAGGGQLPGGAGSSPPAIGQQQQAALPPDPSSSLPVYNVPPNLFHTGGGMNNDAYAWAHSRSPY